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. Considerable insight into the functional activity of proteins and enzymes can be obtained by 

' studying the low-energy conformational distortions that the biopolymer can sustain. We carry out 

' the characterization of these large scale structural changes for a protein of considerable pharmaceu- 

04 , tical interest, the human /3-secretase. Starting from the crystallographic structure of the protein, we 

(— I . use the recently introduced /3-Gaussian model to identify, with negligible computational expendi- 

' ture, the most significant distortion occurring in thermal equilibrium and the associated time scales. 

The application of this strategy allows to gain considerable insight into the putative functional 
QQ . movements and, furthermore, helps to identify a handful of key regions in the protein which have 

an important mechanical influence on the enzymatic activity despite being spatially distant from 
the active site. The results obtained within the Gaussian model are validated through an extensive 
comparison against an all-atom Molecular Dynamics simulation. 
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Introduction 



In the past decades several experimental and theoretical studies have pointed out that, although proteins possess 
an atomic density comparable to that of crystalline solids, they are much more flexible that the latter Jj, ,2i, i4li This 
■ O ' unusual elasticity, whose origin putatively resides in the neat secondary and tertiary protein organization is a 
^ ■ prerequisite for biological functionality. In fact, in order to carry out their biological tasks, proteins and enzymes 
need to sustain conformational distortions where groups of several amino acids are significantly displaced from the 
reference native states. The time scales associated to such configurational changes, which may occur over several 
nanoseconds, is also long compared to those of atomic motions 0- 

From a computational point of view, the most direct way of observing these rearrangements would be to resort to a 
Molecular Dynamics simulation Present implementations of this scheme allow to follow the dynamical evolution 
of a large protein (of a few hundred residues) in its surrounding solvent for about one nanosecond. This time scales 
is large enough to gain considerable insight into several dynamical aspects of large proteins but may be inadequate 
7-H to characterize accurately the functional movements mentioned before Q. 

Several studies have attempted to bridge the gap between the time scales of feasible MD simulations and the ones 
, of biologically-relevant protein movements by recursing to a mesoscopic rather than a microscopic approach Q . A key 
' contribution in this framework has been the observation that the over-damped dynamics of a protein in its solvent 
can be described as occurring in an effective quadratic potential |^, 0, Ull ll^ [l^ . This observation was further 
supported by Tirion who pointed out that, in a normal mode analysis of protein vibrations, the complicated classical 
force- field could be replaced by harmonic couplings with the same spring constants Q. These results stimulated a 
, variety of studies where the elastic properties of proteins have been described by means of coarse-grained models where 
r-| ' amino acids are replaced by effective centroids (corresponding to the Cq. and/or Cp atoms) and the energy function is 
Q , reduced to harmonic couplings between pairs of spatially close centroids. These approaches have been found to be in 
O ' accord with both experimental and MD characterization of the overall protein elastic behaviour [l^ Hsl UtI IT^ . 
In particular, the recently-introduced /?-Gaussian modelhas been shown to be quite effective in identifying, with a 
modest computational effort, the most important conformational changes that a protein can undergo in thermal 
rS I equilibrium [l9j. These vibrational modes are here investigated for a protein, the human /9-secretase (denoted as 
^ I BACE hereafter) which is an important representative of the pepsins, a family of enzymes which are capable of 
■ " " '■ cleaving a peptidic substrate through a chemical reaction (hydrolysis) involving an aspartic dyad. It is important 
to point out that a distinctive feature of this enzyme is the fact that the two aspartates are located on opposites 
sides of the interface between the two lobes. Recently, the BACE, a membrane-anchored extracellular protein, has 
been the subject of several experimental investigations since it constitutes a key-target for drugs used in treatments 
of the Alzheimer's disease (AD) m El- The onset of this neuro-degenerative disease is related to the formation 
of /3-amyloid plaques in brain tissues. Recent investigations have demonstrated that the /3-amyloid peptides are the 
product of the cleavage of a precursor protein operated by the BACE. These results have motivated a large body 
of scientific investigations aimed at clarifying the cleavage mechanisms of the BACE and its differences functionality 
with respect to typical members of the pepsin family. Interestingly, from a structural point of view, see Fig. 
the core region of the BACE presents only minor differences from typical pepsin members. Most of the structural 
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FIG. 1: (a) Structure of the human /3-secretase (PDB file Ifkn) complexed with an inhibiting peptide (represented with a 
wire- frame) . The colour highlights the presence of two lobes in the enzyme, (b) The contact matrix, A, associated to the 
native (crystallographic) structure of the enzyme. 



changes are instead located at the surface of the protein, in the form of six loop insertions, and at the C-terminus, by 



As visible in Fig. (Q, where the BACE is shown together with a bound inhibiting peptide, a long cleft of 35 A is 
present at the interface of the two lobes (highlighted with different colours). Interestingly, a very small number of 
residue pairs are found in contact at the lobes interface and they are mostly constituted by the aspartic acids involved 
in the catalytic activity. In analogy with other enzymes 19, 23,01 one may anticipate that also the BACE catalytic 
action depend not only on the favourable chemical interaction of the substrate and the aspartic dyads but also on 
the possibility that the conformational fluctuations of the enzyme itself may module the activity. We shall therefore 
undertake the task of characterizing in detail the elastic response of the BACE by resorting to the /^-Gaussian model. 
An independent term of comparison for the robustness of these findings will be provided by an all-atom Molecular 
Dynamics simulation. It is found a posteriori, that the two methods, which are very different in spirit, provide a 
consistent picture for the largest protein rearrangements occurring in thermal equilibrium. Finally, we characterize 
the extent to which the motion of a substrate bound to the enzyme is correlated with the latter. By these means we 
identify a limited number of amino acids in the enzyme which have a strong mechanical bearing of the conformational 
fluctuations of the enzyme despite the lack of spatial proximity. 

The programme which implements the /3-Gaussian model is available, upon request, from the authors. 



Since the main objective is the modelling of the large scale fluctuations in a protein it is convenient to reduce 
the spatial degrees of freedom of the biopolymer through a coarse-graining where a two-particle representation is 
used for each amino acid. Besides the Ca atom, an effective Cjs centroid is employed to capture, in the simplest 
possible way, the sidechain orientation in a given amino acid (except for Gly for which only the Ca atom is retained). 
This reduced structural representation affects the form of the effective Hamiltonian associated to the coarse-grained 
structural representation. Arguably, the simplest energy function for the system can be constructed by assuming that 
all centroids in the protein (Cq and/or C3) whose separation is smaller than a given interaction distance, R, interact 
through the same pairwise potential, V [3 . The information of which centroids are in interaction in the native state 
is aptly summarised in the native contact matrix which takes on the values of 1 [0] if the native separation of 

the particles of type X and Y, belonging respectively to residues i and j, is below [above] R. The contact map of the 
BACE considered in this study is shown in Fig. ^ The system energy function evaluated on a trial structure, T, can 
then be written as 




The /3-Gaussian Model 



(1) 



where 



,CA-CA 
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In expressions Q we indicated with dfj^ the actual separation of the particles in the trial structure, F. The indices 
i and j run over all integer values ranging from 1 up to the protein length, N . To account for the protein chain 
connectivity we have introduced in (0) a "backbone" energy term which mimics the higher strength of the bond of 
consecutive amino acids with respect to non-covalent contact interactions. Consistently with previous studies |l9j | we 
have set R equal to 7.5 A and K —1. 

The general pairwise Hamiltonian of eqn is subject to an important requirement since it must guarantee that the 
assigned reference conformation is at the global energy minimum. The customary way to accomplish this IT3 . ll7lll^ 
is to assume that the potentials V'^~^ appearing in ^ attain their global minimum in correspondence of the 
native separation of the ccntroids. For small fluctuations around the native structure, the interaction energy of two 
centroids, i and j, can then be expanded in terms of the deviations from the native distance- vector, . If we indicate 
the deviation vector as , so that the total distance vector is dy = + x^ , we can approximate the pairwise 
interaction as 



CA-CAyr/ iCA-CA 



v{d: 



CB-CBy^^B-CB^ 



(2) 



ij 



(3) 



where fi and v denote the Cartesian components, x, y and z, and k is the second derivative of V at its minimum. The 
expansion of Eq. Q brings about a dramatic simplification of Hamiltonian Q which, in fact acquires a quadratic 
dependence in terms of the deviation vectors, x. However, a further simplification of the energy function can be 
achieved by exploiting the fact that the coordinates of the Ca centroids encode, in an almost unique way, the positions 
of all other atoms in the protein, and hence also of the C/s atoms 25, 2Gj . The construction scheme adopted in the 
/3-Gaussian model (denoted as /3-GM in the following) defines the location of the i-th Cfj through a co-planar version 
of the Park and Levitt construction rule p7| : 



2rcA{i) -rcA{i + l) -rcA{t-l) , . 

|2 rcA[i) - rcA[i + 1) - rcA{i - 1)| 

where I = 3 A. Thus, the degrees of freedom can be reduced to only those of the C^'s. The linear relationship between 
the Ca and C/j coordinates allows to recast the Hamiltonian as follows. 



where cc^^^ is the deviation of i-th Ca along the axis and M. is a, 3Nx3N symmetric matrix. The elastic response 
of the system is uniquely dictated by the eigenvalues and eigenvectors of A4 . 

In thermal equilibrium, each amino acid moves under the action of the quadratic Hamiltonian jSjl subject to a 
viscous friction originating from interactions with the surrounding solvent as well as with the rest of the protein l28l . 
The viscous hindrance of the motion is so important that the protein dynamics becomes severely over-damped pXI 
nil I12I 28|. In this case it seems appropriate to describe the dynamics of the amino acids within a Langevin 
framework j29>.3Q|| 

7i ii,^^) = ~k^Mtj,f_,^Xj^^{t) -l-?7j,^(i) , (6) 

where ji is the viscous friction coefficient and ?7i,;^(i) is a stochastic noise satisfying the relations p9l Isof 

( mAt) > = , ( ?7^^p(t)r7j,.(i') ) = S,, S^, 5{t - t') 2kBT 7, . (7) 
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Within this dynamical framework it is possible to calculate exactly how correlations among the displacements of 
various pairs of residues decay as a function of time. We start by considering the case where the various viscous 
coefficients in eqn (01 taken on the same value, 7. In this case one has ^31]: 

{x,,,{t)x,At + m = ^E^M^e"''^^*' (8) 

where the average ( . ) in eqn |(SJ) denotes the usual canonical thermodynamics average. The vector v' and the scalar 
A; are, respectively, the /-th eigenvector and the /-th eigenvalue of the matrix M ordered in such way that A; < A;+i . 
It should be noted that since the Hamiltonian is invariant under rotations and translations, it will always possess (at 
least) six eigenvectors associated to eigenvalues equal to zero that are obviously excluded from the sum in expression 
and related ones ilBiii22]- The eigenvectors {v'} represent therefore the independent modes of structural relaxation 
in the protein, while the associated decay times are given by 

Of particular interest, for the purpose of identifying the concerted large scale structural fluctuations occurring in 
a protein, is the degree of correlation of pairs of residues at equal times. This information is summarised in the 
covariance matrix, C, whose elements are obtained by Ai = in Ip^ 



Due to the fact that this matrix contains the full three-dimensional information about pair correlations its linear size 
is 3A^. Typically it is important only to quantify the relative degree of correlation of any two residues; in this case 
one can consider the normalised reduced covariance matrix (of linear size N) which is defined as 



(11) 



where a sum is implied over repeated indices. It is important to stress that the elements of the covariance matrix are 
thermodynamic averages reflecting equilibrium properties of the system and, hence, are independent of the friction 
coefficients. Through eqn one sees that the same set of vectors {v'} describing the modes of relaxation also 
describes the independent modes of structural fluctuations in thermal equilibrium. The contribution of each mode 
to the pair correlation is inversely proportional to A/ thus establishing the intuitive result that the modes associated 
to the longest relaxation times are those responsible for the largest structural fluctuations. The equivalence of the 
vectors describing the modes of relaxation and structural fluctuations is a consequence of having used the same viscous 
coefficient for all amino acids in the protein. When this simplifying assumption is not made one has that the modes 
and times of relaxation are determined by diagonalizing instead the symmetric matrix 



M.,,,,,u = ^^^^ ■ (12) 

On the other hand, the modes of structural fluctuation are the same as before, as can be ascertained by calculating 
directly the thermodynamic average {xi^^Xj^v) with the canonical weight associated to Hamiltonian 



I. CONFORMATIONAL FLUCTUATIONS OF THE BACE 



We have applied the above-mentioned analysis to the native structure of the BACE shown in Fig. (^). The 
associated normalised reduced covariance, computed through the /3-GM is shown in Fig. (jSJ. The comparison of the 
covariance matrix with the contact map of Fig. ^p) shows that the highest positive correlations are observed, as 
expected, in correspondence of contacting residues. More interesting is the presence of neg ative correlations which 
signal important mechanical couplings between regions that are not in spatial proximity l23 | . The inspection 

of Fig. (O reveals a significant degree of anti-correlation among residues on one lobe (especially 40-55 and 100-114) 
and those on the other one (especially 260-272 and 305-320). The fact that the conformational fluctuations of the two 
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FIG. 2: Reduced covariance matrices for BACE, as obtained from /3-Gaussian model computations. Entries with values close 
to I (strong positive correlation) are shown in blue, while anti-correlated pairs are shown in red. 




FIG. 3; Pictorial representation of the first three modes (from left to right) which describe the largest conformational distortions 
in the BACE. 



lobes are, on average, directed in opposite directions suggests that the enzymatic functional modes may be based on 
an opening/closing mechanism of the lobes. 

This insight can be considerably refined by examining the individual contribution of the various eigenvectors of A4 to 
the pair correlations. It is apparent from the decomposition of Eq. (|IO|l that the weight of the eigenvectors is inversely 
proportional to the corresponding eigenvalues. For this reasons we shall now describe the structural deformations 
encoded by the first three eigenvectors which, alone, are responsible for a good fraction of the overall residue mobility. 
By superimposing to the native structure the distortion associated to the first mode one can ascertain that it involves 
the movement of the lobes along opposite directions with respect to the plane identified by the cleft of the enzyme 
(each lobe moving almost rigidly while the connecting regions between the lobes is almost static). In the second mode 
the anti-correlated motion of the lobes is still visible but this time the motion occurs mostly parallel to the cleft plane, 
resulting in a shear deformation, as depicted in Fig. (|3J) while in the third mode, the two lobes appear to rotate in 
counter- directions. In all three modes the high mobility of the exposed loop ranging from Val309 to Asp3I7 is very 
noticeable. 

So far we have summarised the results obtained with the /?-GM which relics on several simplifying assumptions 
(including the use of the same friction coefficient for all residues, a limitation that will be removed later). It is 
therefore important to verify the validity of the conclusions reached here against independent terms of reference. 
Ideally, one would like to compare the model results against direct experimental determination of the quantities of 
interest here, such as the correlation of residues' motion. However, this is not presently feasible in a direct way and 
hence such detailed information can only be obtained from dynamical simulations with all-atom interaction potentials. 
We therefore carried out a MD simulation of the BACE in explicit solvent. The simulated system was constituted 
by the protein immersed in a water box of size 75 A x 87 A x 90 A to which 9 sodium counter-ions were added to 
ensure the overall charge neutrality. The whole system, composed of about 48,000 atoms, underwent 20 ns of MD 
simulatio ns ( after relaxation) carried out at 300 K with the GROMACS program [33L l33|. The Amber force-field 
(Parm98 f39|) was used to describe the protein and the counter-ions, whilst the TIP3P model was used for water. 
Particle mesh Ewald routines were used to treat long-range electrostatic interactions. A cut-off of 12 A was used for 
the Van der Waals interactions and the real part of the electrostatic interactions. 

We carried out the comparison of the /3-GM and molecular dynamics through a series of steps which include the 
comparison of the overall mobility of the various amino acids, of corresponding entries of the covariance matrices 
and of the largest eigenvectors of the covariance matrix. These comparisons are here exploited not only to check the 
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consistency of the two approaches, but also as a means to assign the /3-GM parameter k from the MD comparison. 

The first quantity that we shaU compare is the mean-square fluctuation which summarises the overall mobility of 
any given residue. Within the /3-GM this quantity is given by the diagonal elements of A^^^ 



(13) 



while, in the context of Molecular Dynamics, the thermodynamic average of Eq. H13|) is aptly replaced by the time 
average over the simulation run. It should also be noted that, usually, it is not easy to ascertain whether the 
simulated trajectory is sufficiently long that thermodynamic averages can be legitimately replaced with dynamical 
ones. A practical way of checking a posteriori the validity of this ergodicity assumption, is to check the consistency 
of the essential eigenspaces pertaining to different parts of the simulated trajectories. Indeed, this type of analysis 
shows that when the simulated time span greatly exceeds the typical time-scale associated to the slow dynamical 
modes then the essential subspaces are robustly identifiedls^. Besides this strategy, a recent theoretical study has 
introduced some valuable quantitative criteria by which it is possible to decide whether a given MD trajectory is too 
short to meet the ergodicity requirements f^. These criteria are, again based on the analysis of the essential dynamical 
spaces. When the dynamical sampling of phase-space is insufficient then the eigenvectors of the essential spaces have 
distinctive cosine-like shapes. By converse, it has been shown that when the cosine content of the relevant dynamical 
eigenspaces is negligible then the ergodicity assumption appears justifiedR|. In the present study we have adopted 
both these criteria: the total simulation time span of 20 ns is about thirty times bigger than the longest autocorrelation 
time found in the system (see last section); in addition the cosine content of the top 5 essential eigenvectors was, on 
average, below 10 %. 

It must be noticed that, within the f3-GM the mean square displacements are proportional to ksT/k. The compar- 
ison of the MD and model mean square displacements offers, therefore, the opportunity to set the effective value of k 
by, e.g. matching the average mean square displacements in the two cases. This criterion yields, for this particular 
protein, the value k^^ — 0.7 K? /Kb T. The profiles for the residues mobility according to MD and the /3-GM (having 
fixed k to the value mentioned above) are shown in Fig. Q). 
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FIG. 4: Mean square fluctuations of the various residues in the BACE obtained from MD and calculated from the /3-GM after 
an optimal choice of the harmonic coupling parameter, k. 



As mentioned before, Fig. © reveals the high mobility of the exposed loop spanning residues 309-317. Aside from 
this, the two profiles have a good degree of correlation since the linear correlation coefficient is 0.78 while the more 
stringent Kendall (non-parametric) coefficient one is 0.61 "37, "ss]. 

Having ascertained the consistency of the overall residues mobility of MD and /?-GM we analyze the agreement of 
the covariance matrices which convey information on pair correlations. We carry out the comparison in two stages: 
first by comparing corresponding entries of the matrices and subsequently by measuring the overlap between the 
significant subspaces of the matrices. 

The degree of accord of the normalized reduced covariance matrices of the MD and /3-GM is conveniently summarised 
through a scatter plot of corresponding entries of the two matrices. The results are summarised in Fig. (jSJ where we 
have deliberately omitted the diagonal entries of the matrix which are equal to 1 in both cases. Since the length of 
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the BACE approaches 400 residues, the data set shown in Fig. ((Sj) is constituted by more than 7.5 10^ distinct data 
points. The hnear correlation coefficient of the MD and (3-GM data is r = 0.83. This represent a strikingly high value 
given the huge number of degrees of freedom. However, a precise quantification of its statistical significance cannot be 
conveyed since the data are manifestly not distributed in a binormal fashion and the Kendall non-parametric analysis 
is inapplicable to to the excessive number of data points We conclude the discussion of the scatter plot of Fig. Q 
by mentioning that, in case of perfect correlation of two normalised (adimensional) covariance matrices the data would 
align along the diagonal of the graph in Fig. JS)). Interestingly, the best-fit line lies very close to the diagonal having 
a slope of s = 1.04, a fact which further testifies the overall viability of the Gaussian scheme. 
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FIG. 5: Scatter plot of corresponding entries of the reduced covariance matrices obtained by the /3-Gaussian model and MD 
simulations of BACE. The interpolating line, determined by linear regression, is also shown. 



As a final test we wish to measure the consistency of the most important eigenspaces of the two matrices, that is the 
eigenvectors associated to the largest eigenvalues of the covariance matrix (for /3-GM this is equivalent to considering 
the smallest eigenvalues oi Ai). In fact, these eigenspaces are the ones that describe the most significant modes 
of distortion of the molecule in the three-dimensional space. Various strategies have been proposed to evaluate the 
accord of the significant eigenspaces of the covariance matrices 0, 0, 113 ■ Here we have adopted the simple strategy 
of considering the top 10 eigenvectors obtained from MD and the /3-GM and calculating the modulus of the scalar 
product of any pair of vectors coming from the two sets. It is found that the overlap between the two essential 
eigenspace is considerable since about 75 % the norm of the first four eigenvectors of the /3-GM are projected on the 
first four eigenvector of MD. Overall the root mean square inner product suggested by Amadei et al. |36| among the 
two sets was 0.6 which denotes a non trivial correlation, although a precise assessment of its statistical significance is 
difficult to ascertain |^l3^. 

It is also interesting to evaluate the agreement of corresponding (i.e. equally ranking) eigenvalues of the covariance 
matrices. This analysis is carried out in the scatter plot of Fig. 10 where, for the /3-GM the ith largest ranking 
eigenvalue of the covariance matrix was taken as the inverse of the ith smallest eigenvalue of A^~^ multiplied by the 
previously found coefficient ksT/k. Since the eigenvalues span a few orders of magnitude, the plot is presented in 
a log-log format and reveals a relationship that rather than being simply linear is better described by a power-law 
dependence with exponent of about 1.4. 

After having characterize the accord of the conformational fluctuations occurring in thermal equilibrium within the 
all-atom MD and the (3-GM we focus on the characterization of the mechanical coupling between the BACE enzyme 
and the substrate (peptide) on which it acts. This analysis is accomplished by applying the Gaussian model not 
to the isolated enzyme but the enzyme/substrate complex shown in Fig. The crucial questions to formulate in 
this context regard the possible existence of non-trivial mechanical couplings between the substrate and residues in 
the enzyme. For other enzymes, in particular the HIV-1 protease, it has been shown that several residues of HIV-1 
protease despite being far away from the substrate have an important mechanical bearing on the latter UM K is by 
virtue of this mechanical coupling that it is possible to rationalize the emergence of mutations causing drug resistance 
in correspondence of sites far from the cleavage region (e.g. Met63Ile, Met46Ile-Leu and Leu47Val) 40, 41, 42] . In 
fact, as the explicit MD calculation has shown, the high degree of such coupling between such sites and the cleava ge i s 
such that the detailed chemical identity of the former strongly influence the substrate binding affinity of the latter|24l|. 
Therefore, mechanical couplings can be so important for enzymatic catalysis that mutations at a small number of key 
enzymatic sites (even if distant from the active site) can dramatically alter the enzyme reactivity. We have therefore 
undertaken a similar analysis here and calculated the degree of correlation of the displacement of residues in the 
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FIG. 6: Scatter plot of equally ranking eigenvalues of the MD and /3-GM covariance matrix. 




FIG. 7: The curve indicates the degree of correlation, calculated with /3-GM, of a residue in the middle of the inhibiting peptide 
and the 387 residues that constitute the BACE. 



substrate and all residues in the BACE. The correlation profiles are nearly identical for all residues in the substrate 
and a representative profile is shown in Fig. Q. 

From the profile a strong positive correlations is discernible in correspondence of residues 72 and 230 (numbering 
according to the PDB file). This reflects the close spatial proximity of the substrate and these contacting residues. The 
crucial feature is however contained in the peaks of negative correlations which occur in correspondence of residues 
23, 61, 83, 255, 269, 317. These residues are very distant from the substrate (25 - 30 A) and yet are able to influence 
the conformational fluctuations of the substrate bound to the active site. These sites are located at the outward face 
of each of the two lobes, in close analogy to the key mechanical sites found for HIV-1 protease. This fact provides 
additional evidence in support of the fact that the steering of the substrate towards a reactive conformation is achieved 
not only through flnely-tuned chemical couplings with the enzymes but also through subtle mechanical influence of 
sites far apart from the active region. 

We conclude our analysis by discussing the possibility to gain insight through the /3-GM on the time scales involved 
in the most important conformational fluctuations. To do so it is necessary to have a quantitative estimate for the 
effective friction coefficients appearing in Eq. © . It has been pointed out before 0, ^| that the effective friction 
coefficient experienced by the various amino acids in a protein depend not only on the interaction with the solvent 
but are severely affected by protein density itself in their neighbourhood. In particular, Hinsen et al p3 | have recently 
provided a phenomenological characterization of the viscous coefficient which they found to have an approximate 
linear relationship in terms of the local atomic density. We have translated this phenomenological relationship in 
terms of the number of contacts, to which residue i takes part (calculated with a cutoff distance of 7.5 A) and thus 
arrived to the phenomenological relationship 
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FIG. 8: Density plot for the modulus of the scalar product of the eigenvectors of matrix Ai (rank index given on x axis) against 
those of matrix Ml (rank index given on y axis). Entries with values close to 1 [0] are shown in black [white]. 



7, =i?< (14) 

where B = 1.53 10^ amu/ps. We have therefore calculated the various modes relaxation in this new context where the 
friction coefficient is not the same for all residues. Strikingly we have found that that there is a good correspondence 
between the modes of relaxation (which depend on 7^ and reflect dynamical properties) with the eigenvectors which 
describe conformational fluctuations in thermal equilibrium. This correspondence is obviously exact in the case when 
7i is the same for all sites but is not otherwise expected in case of heterogeneity of the friction coefficient. The 
degree of accord of the two sets of eigenvectors is shown in Fig. JHJ and indeed the thick cloud around the diagonal 
highlights their good correspondence. The time scales which controls the decay of conformational fluctuations in a 
dynamical trajectory are obtained from the inverse of the eigenvalues of A4 through eq. |^ Therefore, we obtain that 
the slowest relaxation time calculated within the Gaussian model has a decay time of 1.2 ns. The analysis of the decay 
of autocorrelation in the whole MD trajectory indicates 0.7 ns as the slowest relaxation time of the system. It appears 
therefore, that the simple Gaussian approach can correctly identify the time scales of the system autocorrelation 
within a factor of two. Therefore, despite the several approximations which are at the basis of the /3-GM the latter 
appears to be useful also for estimating, with a modest computational expenditure, the correct order of magnitude of 
the system autocorrelation time. This fact may be exploited to have a preliminary indication of the lapse of time that 
needs to be covered in an all-atom MD simulation to ensure that the system has sufficient time to explore significantly 
different regions of the phase space. 



II. CONCLUSIONS 



We have studied the conformational fluctuations of BACE with the purpose of clarifying the effect of conformational 
fluctuations on the capability of enzymes to steer the substrate in a favourable reactive conformations. In analogy 
with results obtained on a different class of enzymes I2M 0| also for this enzyme it is found that a few sites that 
are located remotely from the active site have a strong mechanical influence on the enzyme-substrate couplings. The 
analysis was carried out using a simple and computationally inexpensive Gaussian network scheme. Several aspects 
of the results obtained within the simplified coarse-grained model have been validated and tested against an extensive 
all-atom MD simulation. 



10 



Acknowledgements 

We are indebted to Paolo Carloni and Luca Marsella for several illuminating discussions and suggestions. We 
acknowledge support from INFM-Democritos and MIUR Cofin 2003. 



[1] Levitt, M.; Sander, C; Stern, P. S. J, Mol. Biol. 1985, 181, 423 447. 

[2] Horiuchi, T.; Go, N. Proteins: Structure Function and Genetics 1991, 10, 106-116. 

[3] ben Avraham, D. Physical Review B 1993, 47, 14559-14560. 

[4] Micheletti, C; Lattanzi, G.; Maritan, A. J. Mol. Biol. 2002, 321, 909-921. 

[5] Garcia- Viloca, M.; Gao, J.; Karplus, M.; Truhlar, D. G. Science 2004, 303, 186-195. 

[6] Karplus, M. Acc. Chern. Res. 2002, 35, 321-323. 

[7] Hess, B. Phys. Rev. E 2002, 65, 031910. 

[8] Tirion, M. M. Physical Review Letters 1996, 77, 1905-1908. 

[9] Noguti, T.; Go, N. Nature 1982, 296, 776-778. 
[10] Brooks, B.; Karplus, M. Proc. Natl. Acad. Sci. USA 1985, 82, 4995-4999. 
[11] Hinsen, K. Proteins: Structure Function and Genetics 1998, 33, 417-429. 
[12] McCammon, J. A.; Gelin, B. R.; Karplus, M.; Wolynes, P. G. Nature 1976, 262, 325-326. 
[13] Swaminathan, S.; Ichiye, T.; van Gusteren, W.; Karplus, M. Biochemistry 1982, 21, 5230-5241. 
[14] Bahar, I.; Atilgan, A. R.; Erman, B. Folding and Design 1997, 2, 173-181. 

[15] Bahar, 1.; Atilgan, A. R.; Dernirel, M. C.; Erman, B. Physical Review Letters 1998, 80, 2733-2736. 

[16] Jernigan, R. L.; Demirel, M. C.; Bahar, I. International Journal of Quantum Ghemistry 1999, 75, 301-312. 

[17] Doruker, P.; Atilgan, A.; Bahar, I. Proteins: Structure Function and Genetics 2000, 40, 512-524. 

[18] Atilgan, A. R.; Durell, S. R.; Jernigan, R. L.; Demirel, M. C.; Keskin, O.; Bahar, I. Bioph. J. 2001, 80, 505-515. 

[19] Micheletti, C.; Carloni, P.; Maritan, A. Proteins 2004, 55, 635-645. 

[20] Sinha, S.; Lieberburg, I. Proc. Natl. Acad. Sci. USA 1999, 96, 11049-11052. 

[21] Lin, X.; Koelsch, G.; Wu, S.; Downs, D.; Dashti, A.; Tang, J. Proc. Natl. Acad. Sci. USA 2000, 97, 1456-1460. 

[22] Hong, L.; Koelsch, G.; Lin, X.; Wu, S.; Terzyan, S.; Ghosh, A. K.; Zhang, X. C.; Tang. 3. Science 2000, 290, 150-153. 

[23] Plana, S.; Carloni, P.; Parrinello, M. J. Mol. Biol. 2002, 319, 567-583. 

[24] Piana, S.; Carloni, P.; Rothlisberger, U. Prot. Sci. 2002, 11, 2393-2402. 

[25] De Mori, G. M. S.; Micheletti, C; Colombo, G. J. Phys. Chern. 5 2004, 108, 12267-12270. 

[26] De Mori, G. M. S; Colombo, G.; Micheletti, C. Proteins 2004, m press. 

[27] Park, B.; Levitt, M. Proteins: Structure Function and Genetics 1996, 258, 367-392. 

[28] Hinsen, K.; Petrescu, A.; Dellerue, S.; Bellissent-Funel, M.; Kneller, G. R. Ghem. Phys. 2000, 261, 25-37. 
[29] Doi, M. Introduction To Polymer Physics; first ed., 1996. 

[30] Howard, J. Mechanics of motor proteins and the cytoskeleton; Sinauer Associates: Sunderland, MA, 2001. 
[31] Chandrasekhar, S. Rev. Mod. Phys. 1943, 15, 1-89. 

[32] Micheletti, C; Cecconi, F.; Flammini, A.; Maritan, A. Protein Science 2002, 11, 1878-1887. 
[33] Berendsen, H. J. C; van der Spoel, D.; Vandrunen, R. Gomp. Phys. Gomm. 1995, 91, 43-56. 
[34] Lindahl, E.; Hess, B.; van der Spoel, D. J. Mol. Mod. 2001, 7, 306-317. 

[35] Cornell, W. D.; Cieplak, P.; Bayly, C. I.; Gould, I. R.; Caldwell, J. W.; KoUman, P. A. J. Am. Ghem. Soc. 1995, 117, 
5179-5197. 

[36] Amadei, A.; Ceruso, M. A.; Nola, A. D. Proteins: Structure Function and Genetics 1999, 36, 419-424. 
[37] Halle, B. Proc. Natl. Acad. Set. USA 2002, 99, 1274-1279. 

[38] Press, W. H.; Teukolsky, S. A.; Vcttcrling, W. T.; Flannery, B. P. Numerical Recipes; CUP: Cambridge, 1999. 

[39] Amadei, A.; Linssen, A. B. M.; Berendsen, H. J. C. Proteins 1993, 17, 412-425. 

[40] Brown, A. J. L.; Korber, B. T.; Condra, J. H. AIDS Res. Hum. Retroviruses 1999, 15, 247-253. 

[41] Boucher, C. AIDS 1996, 10, S15-9. 

[42] Molla et al., A. Nat. Med. 1996, 2, 760-766. 



